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Abstract 

With the analysis of the hierarchical scheme, the potential influence of the pre-constant in deriving 
scaling laws is exposed. It is found that a modified hierarchical scheme can achieve a throughput 
arbitrarily times higher than the original one, although it is still diminishingly small compared to the 
linear scaling. The study demonstrates the essential importance of the throughput formula itself, rather 
than the scaling laws consequently derived. 
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Scaling-law study of the capacity of wireless networks is a retreat when the exact characteri- 
zation is out of reach. Although it aims at lower goals, it opens an avenue of obtaining concrete 
results. Such results are asymptotic in nature, but can be very insightful especially for networks 
with a large number of nodes. 
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Consider a wireless network of n nodes, where each node is an independent source and wants 



to send information to some other node in the network. What are the achievable rates? For this 
problem, the seminal work [1] showed that the multi-hop operation achieves a scaling law of 
Q(^/n) for the total throughput. That is, on average, each source-destination pair enjoys a rate of 
0(y / n)/n = 0(^), which unfortunately tends to zero as n goes to infinity. This was not good 
news! It implies that no constant rate can be maintained for all source-destination pairs when 
the network size n grows. Obviously, in order to maintain a constant rate, a linear scaling 0(n) 
of the throughput has to be achieved. 

Although the multi-hop operation has indeed been the focus of much protocol development, it 
is well known from multi-user information theory that there are many cooperation schemes that 
can achieve higher rates. Hence, the question remains: Is linear scaling achievable, if based on 
multi-user cooperations? 
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Recently, a hierarchical scheme based on multi-user cooperations was proposed in [2], where, it 
was shown that for any e > 0, the scaling 0(n 1_e ) is achievable under some network conditions. 
This is a significant improvement over the scaling Q(^/n) achieved by the multi-hop operation. 
However, the paper [2] cannot claim that linear scaling is achievable although e can be made 
arbitrarily small, due to the reason that the pre-constant of the scaling is e-dependent and actually 
decreases to zero as e decreases to zero. 

There is a subtle difference between the scaling-law study in [1] and the scaling-law study 
in [2]. In [1], the pre-constant of the scaling is easy to determine due to the fixed link rate in 
the multi-hop operation, which does not change even when the network size grows. However, it 
is not so simple for networking strategies based on multi-user cooperations, which is the case 
in [2]. Unfortunately, in [2], the pre-constant was not addressed. However, negligence of the 
pre-constant results in incomplete pictures, and can even lead to misleading conclusions. 

In [2], different scaling laws were claimed for dense networks (in a fixed area) and extended 
networks (with a fixed density) as the number of nodes goes to infinity. However, any practical 
network is in a fixed area, and with a fixed density. It can either be embedded into a series of 
increasingly denser networks, or a series of increasingly more extended networks. Then, what 
can the two different scaling laws tell about the design and operation of this practical network if 
they are contradicting to each other? Well, the only explanation is that the scaling laws must be 
irrelevant to the design and operation of any practical network that lies in a fixed area and has 
a fixed density. 

Is there anything wrong? Not really, if one takes into account the pre-constant. Consider the 
following simple equation: 

Cin 71 = c 2 n 72 . 

Obviously, for any ^ > 72, we can always find C\ < c 2 for the above equation to hold for 
any n. This actually indicates that the scaling exponent 7 can be made arbitrarily large if the 
pre-constant c is not fixed. 

Therefore, without addressing the pre-constant, the scaling laws claimed in [2] are susceptible 
to the ambiguity indicated above. Indeed, in [2], the way of improving the scaling exponent 
is by increasing the number of hierarchical layers h, such that the corresponding scaling order 
can be arbitrarily close to linear as ^ — > 1. However, the unaddressed pre-constant 
is actually /i-dependent, and decreases to zero as h goes to infinity as demonstrated in [3], [4]. 
That is, the correct and complete expression should be c(/i)nV\ with c(h) — > as h — > 00, 
instead of a single which cannot uncover the whole story. 
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The more careful study [3], [4] of the hierarchical scheme showed that it is not always better 
to choose larger h for any fixed n. Actually, for any n, the optimal h to choose is 

h*(n) = ^/log^/2) (1) 

where (3 is a constant depending on the basic SINR (signal to interference-plus-noise ratio) in the 
network. This implies that any larger h will result in a bigger loss in c(h) compared to the gain 
from n~f^ . It was also shown that with the optimal choice of h and the corresponding optimal 
cluster sizes, the maximum achievable throughput by the hierarchical scheme is 

T*( n ) = (3R (n/2) 1 V log ^ (n/2) (2) 

where R is another constant, also depending on the basic SINR in the network. It can be easily 
checked that 

n 

That is, compared to linear scaling, the throughput achieved by the hierarchical scheme is 
monotonely getting worse as n increases, and the average rate per source-destination pair goes 
to zero. 

One might argue that the scaling exponent 1 — 2 = = in © does converge to 1 as n — > oo, 
and thus, can be replaced by 1 — e for arbitrarily small e > 0, the same as the expression in [2]. 
However, note that this e is n-dependent, and smaller e requires larger n, which in turn magnifies 
the importance of e. This is exactly why T*(ri) becomes arbitrarily times worse than n although 
the exponent does converge to 1. 

But still, does this matter, if it can be claimed that any scaling of Q{n 1 ~ e ) is achievable for any 
fixed e > 0, although the pre-constant is e-dependent, and diminishes to zero as e — > 0? Yes, it 
matters, for practical design and operation of wireless networks, if the scaling law studies intend 
to be insightful or even relevant. First, as explained above, it becomes clear that for any practical 
network, it is not always better to choose more hierarchical layers. More layers do increase the 
exponent, but also introduce more overhead when supporting the hierarchical structure. There 
will be some point, beyond which, the overhead overtakes the benefit of adding more layers. As 
a simple example, for the case where (3 = 10, (QQ) shows that the optimal number of layers for a 
network of 20000 nodes is 2, i.e., the simplest three-phase operation in the hierarchical scheme, 
and the corresponding throughput is c(2) a/20000, which actually is of the same order as that 
offered by the simple multi-hop operation. Then whether to use the hierarchical scheme or the 
multi-hop scheme is completely determined by the pre-constant. 
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Moreover, even if concentrating on the limiting behavior as n — > oo, we will show in this 
paper that a modification of the hierarchical scheme can achieve a throughput T* (n) that can be 
arbitrarily times better than T*(n) in the sense that 

T*(n) 

— . . — > oo, as n — ► oo. 
T*(n) 

Actually, a more careful evaluation shows that 

7T(n) 

— — > oo, for any a > 1. 

i (n) iog a n 

The potential of discovering such more superior schemes may have been ignored if one overlooked 
the importance of e or the pre-constant. 

The remainder of the paper is organized as the following. In Section II, we point out an 
immediate improvement that can be made on the hierarchical scheme proposed in [2], and 
introduce a modification. The throughput analysis and optimization of the modified scheme will 
be carried out in Section III, and will also be compared to the original scheme. In Section IV, 
we discuss the drawbacks associated with the notions of "dense" and "extended" networks, so 
artificially coined for scaling law studies, and propose a unified and direct way of addressing the 
real issues. Finally, some concluding remarks are presented in Section V. 

II. Clustering Multiple-Access with Relay 

We introduce a simple modification to the hierarchical scheme proposed in [2]. The basic 
element in the modification is multiple-access. That is, multiple nodes want to send their inde- 
pendent bits to the same node simultaneously. However, instead of accomplishing this in one 
step, we use a hierarchical structure, where the bits are relayed via multiple levels of clusters 
until reaching the final destination. Before going into the details, let us first examine the scheme 
in [2] to see where improvements can be made. 

The network under study consists of n nodes. There are n source-destination pairs evenly 
distributed, so that each node is a source for some other node, and also is the destination of 
some other source. For convenience, let's call this the original S-D pair problem. In order to 
introduce cooperations, the network is first divided into clusters, each of Mi nodes. 

The basic element in the scheme in [2] is the three-phase operation. That is, first a source node 
distributes its bits to the other nodes in the same cluster (different bits to different nodes); then, 
the source cluster sends all these bits to the destination cluster via the virtual MIMO channel; at 
last, all the nodes in the destination cluster send their quantized observations to the destination 
node. Since all nodes are sources, the first step needs to be carried out M x times for all the 
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nodes in the source cluster, which constitute Phase 1; similarly since all nodes are destinations, 
the last step also needs to be carried out Mi times for all the nodes in the destination cluster, 
which constitute Phase 3; moreover, the second step needs to be carried out n times for n S-D 
pairs, which constitute Phase 2. 

Note that in each cluster of Mi nodes, Phase 1 can actually be decomposed into M\ — l original 
S-D pair problems with non-overlapping destination distributions; and similarly can Phase 3 be 
decomposed. It is exactly this observation which leads to the hierarchical structure proposed in 
[2], where, both Phase 1 and Phase 3 can be replaced by another three-phase operation with 
smaller sub-clusters of size M 2 . Then again, the Phases 1 and 3 of the sub-clusters can be 
replaced by another three-phase operation with even smaller sub-sub-clusters. This process is 
continued, with each Phase 1 or Phase 3 being replaced by a three-phase operation with smaller 
clusters, and the hierarchy is built. 

Our modification arises from a different perspective on Phase 1 and Phase 3. Although they 
can be decomposed into a sequence of the original S-D problems, they are essentially a problem 
where every node wants to send to every other node an independent message. From the receiver 
point of view, each node sees the other nodes trying to send independent messages to it via a 
multiple-access channel. Hence, with this new perspective, in a cluster of Mj nodes, both Phase 1 
and Phase 3 can be carried out by Mj multiple-access operations. Since this is a task where there 
are multiple-accesses to all the nodes, it is convenient to name it the all-way multiple-access 
problem. 

The advantage with this new perspective is that with cluster cooperation, the all-way multiple- 
access problem can be accomplished in two-steps, instead of three. That is, first the nodes in 
any one cluster send their bits to the destination cluster via the virtual MIMO channel; then all 
the nodes in the destination cluster send their quantized observations to the destination node. In 
other words, the first step of one node distributing its bits is not longer necessary, because now 
every node has something to transmit to the same destination. Correspondingly, the hierarchy 
proposed in [2] can be modified as in Fig. [Q Compared to the Figure 3 in [2], the difference 
is the elimination of all the Phase l's from the hierarchy, except on the top layer, where the 
problem is still the original S-D pair problem, which cannot be turned into a multiple-access 
problem. 

As stated in [2], the functionality of Phase 1 is for a node to distribute its bits to the other 
nodes in the cluster, in order to establish a virtual multi-antenna transmitter for the MIMO 
communication in Phase 2. However, in retrospect, since different bits are distributed to different 
nodes, there is essentially no mutual understanding among these nodes when they are transmitting 
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Fig. 1. A modified hierarchical scheme that can achieve a throughput arbitrarily higher than the original one. 

together to the destination cluster. Therefore it may be more accurate to think of Phase 2 as a 
multiple-access communication with a virtual receive cluster. With this in mind, then it becomes 
obvious that Phase 2 can be directly carried out without the preparation of Phase 1 if the problem 
is already multiple-access. 

The same modification with a multiple-access perspective has also appeared in [5] in the 
context of minimizing delay. However, the authors there simply claim that the modified scheme 
achieves the same throughput as the original scheme in [2], largely due to the negligence of the e 
as we explained in the Introduction. In next section, we will show that the modified scheme can 
achieve a throughput arbitrarily higher than the original scheme as the network size n grows. 

III. Analysis of the Scaling Laws 

In this section, we analyze the optimal throughput achievable by the modified hierarchical 
scheme proposed in last section. The procedure is similar to that in [3], [4] when analyzing the 
original hierarchical scheme of [2]. It turns out that the improvement can be arbitrarily times 
large as the network size n grows to infinity, i.e., 

T*(n) 

where, is the optimal throughput by the modified scheme, and T*(n) is the optimal 

throughput by the original scheme. A more careful evaluation even shows that 

— — > oo, for any a > 1. 

T*(n) log a n 

However, still, the average rate per S-D pair goes to zero as n — > oo, i.e., 

n 
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Since the analysis procedure is similar to that in [3], [4], we only highlight the differences 
here. Note that the top layer of the hierarchy remains the same. The key issue is to determine 
the time needed to accomplish the all-way multiple-access problem in Phase 1 and Phase 3 of 
the top layer. 

As defined in last section, the all- way multiple-access problem under study can be stated as 
the following. Consider a network of size M\, where, every node wants to send L bits to every 
other node in the network. (Different bits for different pair, i.e., totally, MfL bits^] need to be 
communicated.) The question is how long it takes to accomplish the task? 

We use the modified two-phase operation scheme to accomplish the task. First, we build the 
hierarchical structure. Divide these Mi nodes into clusters of size M 2 ; then divide each cluster 
of M 2 nodes into smaller clusters of size M 3 ; continue this process h — 2 times for some h > 2, 
and finally we obtain clusters of size M h _\. We will determine the optimal value of h to stop, 
i.e., the optimal number of hierarchical layers, and also the optimal cluster sizes M 2 , M 3 , . . ., 
M h _i in the sequel. 

Obviously, the number of time slots needed to accomplish the all-way multiple-access problem 
with the above hierarchical structure depends on the parameters h, Mi, M 2 , . . ., M h _i, L, and 
therefore, is denoted by 

D h _ 1 (M 1 ,M 2 ,...,M h - 1 ,L). (3) 

We'll use a recurrence relation to determine ©. First, note that the all-way multiple-access 
problem of the network of size Mi is accomplished in two phases: Phase 2 and Phase 3, with 
clusters of size M 2 . The number of time slots needed for Phase 2 is simply ^2Mi^, as calculated 
in [3], [4], where R is the basic rate. In Phase 3, it is again the all-way multiple-access problem 
for networks of smaller size M 2 , but now, with £§ff^ bits to be communicated between each 
pair of nodes. Hence, we have the relation 

A_i(Mi,M 2 ,...,M h _i,L) 

= f 2Mi| + AD h _ 2 {M 2l . . . , M,_i, Lgg) 

where the multiplier 4 is needed for time-sharing between neighboring clusters to avoid excessive 
interference. 

'The accurate number should be Mi (Mi — 1)L. However, for simplicity and without loss of much accuracy when Mi is 
large, we use Mi L in the calculation. This approximation won't affect the scaling order. 
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D h _ 2 (M 2 ,...,M h _ 1 ,L%^) 



M 2 
M 3 



In turn, we have the following relation 

t. 

'RM 2 - 

and similar recursive relations for D h _ 3 , D h _ 4 and so on. Hence, recursively, 

D h _ 1 (M 1 ,M 2 ,...,M h _ 1 ,L) 

= f 2M^ 

+ ■■■ 

+4fc -3^ 2Mfc _ 2 L ( fl )fc -S_^_ 

+4^D 1 (M fc _ 1 ,L(2)*- 2 1 ^ r ). 

For the smallest clusters of size M h _i, the all- way multiple-access problem is accomplished 
directly without the two-phase operation, and thus 

D,{M h ^L{-) _)__(_) _ M,_ r 
Therefore, letting c = 4^, 

Ufc-iCMx, M 2 , . . . , Mh-i, L) = 2Mi— ( — i- + c— ^ + c -— ^ + h c^ 3 -^ + 



i? V M 2 M 3 M 4 M h _i 2 

For any fixed Mi, to minimize the sum in the parenthesis above, noting that the product of 
all those terms is 

i+2+...+(h-2) . Mi = c (h - i)( 2 h - i] . Ml 

2 2 ' 

obviously, the optimal choice is that every term equals to 

o-i)o-2) Mi \ 7rrT h^2 Mi . _j_ 

This leads to the optimal choices of cluster sizes: 

M i = 2c 2 — ( i) — ; 2 < i < /i — 1 (4) 
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and the minimum number of time slots: 

D^_ 1 (M 1 , M 2 , . . . , M h _ u L) = 2M 1 ^(h - 1) C V(^)^. 
Therefore, on the top layer, the number of time slots needed for Phase 1 is 

4x2^^-1)^^; 

the number of time slots needed for Phase 3 is 

~ , r L , , „ , h-i , Mi . i Q 
A x 2M 1 -(h - l)c— (^)^|; 

and the number of time slots needed for Phase 2 is still 2n-|, the same as the original scheme. 
After these time slots, the number of bits transported for each S-D pair is M X L, and the total 
number of bits transported in the whole network is nM x L. Therefore, the throughput is calculated 

as 

• /<*). 



16i(A-l)(l + 2)c^(f)^+2„£ 
It is easy to find the optimal choice of Mi by setting /'(Mi) = 0, and we have 



n = 8(1 + %)c^{^-)~^ or equivalently, M 1 = 2 
R 2 

and the corresponding throughput 



8(l + ^)c-r 



h-l 
h h-l 



n~ (5) 



h(l + R/Q) — (4Q/R) — 
For any fixed n, we can find the optimal h to maximize TJ° p *(n) by setting 

dTTin) _ Q 

This leads to 

h 2 \n{2^Q/R) +h- [ln(n/2) - ln(l + R/Q)} = 0. 
Hence, the optimal number of layers to choose is 



y 'l + 4m(2 v /Q7#)[ln(n/2) - ln(l + R/Q)] - 1 
~ 2H2^/QjR) ' 

Similarly as in [3], [4], in order to obtain a simple formula, we use the approximation 

741n(2 v /QAR)ln(n/2) 

h* = ± — == (6) 

2\n(2^Q/R) 
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which is very accurate for large n. Letting j3\ = 2^/Q/R, we have 



ln(n/2) 



Note that 



Therefore, 



h 



lo gfl, («/2) i VtoT 



T° h p \n) = £ — (™/2)^ 

(n/2) — 



h(l + R/Q)-ir/3£ 

Pl . 1 (n/2) " I35 ^ 75T 

^(l + .R/Q) — 



(8) 



where letting h = h* = */log j3l (n/2), we have the optimal throughput 



r *( n ) = ^ (n/2) v /log ' 3 i ( " /2) (9) 

c n ^log^(n/2) 

where 

c n = (1 + i?/<3) V'^i^ -> (l + i?/Q), asn^oo. 

Obviously, © is very accurate for large n, although we made some approximation in © and h* 
should always be an integer. 

Hence, we arrive at the following theorem. 

Theorem 3.1: With the modified hierarchical scheme, by choosing the optimal number of 
layers as © and the corresponding optimal cluster sizes as © and ©, the optimal throughput 
is given by ©. 

Without the approximation ©, we can also obtain an exact upper bound of the throughput as 
the following. By ©, 

lT(n) < A^(n/2) 1- ^ log ^ ( " /2) 

1 7— 2 
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where, in the last inequality, "=" holds if h = y log /3l (n/2). It is easy to check that with the 
modified hierarchical scheme, the average rate per S-D pair still goes to zero as 

l- 



(n/2) V los ?i (n/2) 1 



-(n/2) V los ^" /2) 



n 2 

2 

2) 



= I (/^ og,3l(n/2) ) V /log ' 9 i (n/2) 

-/5 _2v/los,3l(n/2) 
2 Pl 



0. 



However, the modified scheme can be arbitrarily times better than the original one in [2], as 
can be checked with 



(n / 2 ) V 1 ^^ PU ( n /2) V^ n/ ' 



_ Av^ (n/2) ^ (1 -v^ 



Cn/3 



= A Vlog/3 A ^2(l- A /l^T) A /log fl (n/2) 

— > oo, 

where, T*[n) is the optimal throughput of the original scheme as calculated in [3], [4] with 
/3 = 2a/1 + > and thus, log^/^i < 1. Actually, we can show an even stronger result 
that 

T*(n) 



T*(n)log a n 



oo, for any a > 1, 



since 



oo. 



2(1-^^)^(^/2) 

log/3 ^ga W 

IV. Dense or Sparse Networks? 

The analysis in last section has assumed a fixed basic rate R in order to focus on the scaling 
in terms of n. While this is the case under some channel gain model for the so-called dense 
networks, where networks are confined in a fixed area even as the number of nodes grows to 
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infinity, it is not so easy to maintain a fixed basic rate for networks with growing areas, due to 
the power path loss. 

Therefore, when addressing the so-called extended networks, where the node density is fixed 
while the area grows proportionally to the number of nodes, [2] proposed the trick of concentrating 
the total transmission power into a small portion of the total transmission time to compensate 
for the path loss, so that during that portion, the received SINR is maintained at a specific level. 
Then, with the following power path loss model: 



i.e., the received power P r depends on the transmitted power P t via the transmitter-receiver 
distance d and the path-loss exponent a, the scaling for extended networks readily follows by 
multiplying all the results above with the factor n 1_Q / 2 . Namely, to compensate for the power 
path loss, the transmitted power needs be d a times larger, i.e., (\/n) a times larger considering 
long-hop distances in an area proportional to n. Since the required power level is P/n for dense 
networks, the hierarchical scheme can only be operated in n l ~ a l 2 portion of the time for extended 
networks to satisfy the total power constraint, which leads to the multiplication of the same factor 
to all the scaling law results obtained previously. 

More generally, as pointed out in [3], [4], the same trick can be played on networks with any 
area other than either fixed, or linear growing. That is, a network with area A is distinguished 
into two categories based on whether 



In the case where A a l 2 < n, the basic SINR can be maintained all the time, and the power- 
concentration trick is not needed; In the other case where A a l 2 > n, the power-concentration 
trick is needed to maintain the basic SINR for nj A a l 2 portion of the time, and all the results 
correspondingly need to be multiplied by the same factor. For example, the formula © should 
be modified as 



In [3], [4], these two categories are respectively named as "dense" and "sparse" networks. Note 
that this is a notion that can be readily clarified on any specific network based on the relation 
between the area A and the number of nodes n, different from the previous notion of "dense" and 
"extended" networks that is undetermined for any single network. However, one has to realize 
that this new notion is largely a consequence of the hierarchical scheme, and is also related to 
the path-loss exponent. 



Pr = Pt/d a , 



A a/2 < n. 



(10) 




2 



(ID 



13 



A recent work [6] proposes to address the intermediate regime between dense and extended 
networks by introducing a more general pattern of area scaling as 

A = n u (12) 

where v is a real number, with v — corresponding to dense networks, and v = 1 corresponding 
to extended networks. Although it seems more general by the flexibility of choosing different 
values for u, it is still artificial to make the network area scale according to the pattern (fl2~)) . The 
ambiguity of determining the right embedding process for any specific network still remains, as 
we pointed out in the Introduction. 

Above all, the motivation of studying the capacity of wireless networks is clear: To provide 
insight and guidance on the practical design and operation of such networks. After the explicit 
determination of the pre-constant, it is clear that the throughput formulas such as (fTTI) hold for 
any finite number n, and the scaling laws are just a consequence of letting n — > oo. If the 
practical problem under study is a specific network with a specific area and a specific number of 
nodes, then obviously, it is more natural and insightful to apply the formula CCD) directly rather 
than to consult the scaling laws thereafter derived. So probably, one should not be concerned 
with the scaling laws so much as the exact throughput formula itself. 

Note that in the formula (fTT|) . the parameters R, Q, (3\ also affect the throughput, and they are 
determined by the basic SINR, which in turn, is determined by the long-hop path loss. Therefore, 
for the flexibility of selecting different basic SINRs, the criterion (flOl) should be modified as 

A a/2 < c n, (13) 

and the corresponding optimal throughput is modified as 

2T(n, A) = min {l, & R (n/2)~V^ (14) 

1 Aa/2) 

where, c is a constant, chosen to set the threshold of the basic SINR, and thus the values of the 
parameters R, Q, Pi. Generally, smaller c leads to larger basic rate R; however, smaller c may 
also make the condition (fT3l unsatisfied, and thus lead to the scale-down factor c n/ A a ^ 2 in (fl4l) . 
as a result of the power concentration trick. Hence, there is a basic tradeoff in choosing c when 
maximizing (fl4)) . Apparently, the afore-mentioned notions of "dense" and "sparse" networks 
derived via the criterion (TTOl) are rather arbitrary and scheme-dependent than fundamental. 

In summary, (fl4)) presents the optimal throughput achievable by the modified hierarchical 
scheme for a network of n nodes and area A. This is all we need to know. Based on this, all 
kinds of scaling laws can be derived by setting different limits. Now the question is really how 
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good is CHI), for any possible values of n and A, not just when n — > oo. We have presented a 
simple example in the Introduction showing that this is a question even when only compared to 
the multi-hop scheme. In general, we note that the upper bounds obtained in [7], [8] apply to any 
finite network with specific n and A, and in fact encompass more general traffic patterns with 
the criterion of transport capacity, which allows unequal rates and uneven S-D distributions. 

V. Conclusion 

Caution on the pre-constant is needed when deriving scaling laws for wireless networks, 
especially with multi-user cooperation schemes where the overhead may not be negligible. Based 
on explicit analysis of the pre-constant, we have shown that a modified hierarchical scheme can 
achieve a throughput arbitrarily times higher than the original one, although it is still diminishingly 
lower compared to the linear scaling. This leaves the question open whether it is possible to 
maintain a constant rate between each S-D pair when the number of nodes grows to infinity. 

On the other hand, rather than the scaling laws, we have demonstrated the pivotal importance 
of the throughput formula itself as a function of the network parameters. We emphasize that all 
scaling laws can be derived from this formula, and more importantly, it is this formula that is 
directly related to practice. 
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